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ABSTRACT 


_^A  common  feature  of  the  empirical  data  in  applied  mathematics  is  a  low 
signal  to  noise  ratio.  Defined  loosely  as  the  ratio  of  the  magnitude  of 
essential  signals  to  the  magnitude  of  the  worst  noise,  it  often  runs  between 
1/10  and  1/15,  as  was  the  case  in  the  early  work  on  computed  tomography  with 
digitized  medical  x-ray  films  which  was  the  origin  of  this  project.  In  such 
situations,  straight-forward  linear  smoothing  procedures  tend  to  broaden  the 
noise  peaks  without  reducing  the  magnitude  sufficiently. 


A  common  redeeming  feature  of  the  noise  is  that  the  largest  peaks  are 
quite  sparse,  lesser  peaks  are  more  frequent  but  still  sparse,  etc.  This 
suggests  a  stepwise  smoothing  in  which  only  the  worst  peaks  are  removed  during 
the  early  steps  when  there  is  no  very  good  way  to  remove  them. 

Such  a  procedure,  devised  for  use  with  medical  x-ray  films,  is  described 
in  [1].  In  the  meantime  it  has  been  modified  in  various  ways  and  used  with 
other  data  from  several  sources.  \A  detailed  account  of  experiments  with  the 
procedure  will  be  given  elsewhere.'' — The  purpose  of  the  present  report  is  to 
describe  the  procedure  and  to  show  the  results  of  a  series  of  tests  with  data 
from  a  computed  tomography  x-ray  scan  of  a  defective  battery.^  The  scan  was 
taken  with  a  developmental  version  of  an  industrial  scanner  at  the  General 
.  Electric  Research  and  Development  Center  by  H.  J.  Scudder. 
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ITERATIVE  NOISE  ELIMINATION 


Preliminary  Report 
by 

K.  T.  Smith 


1.  Introduction. 

A  common  feature  of  the  empirical  data  in  applied 
mathematics  is  a  loin  signal  to  noise  ratio.  Defined  loosely 
as  the  ratio  of  the  magnitude  of  essential  signals  to  the 
magnitude  of  the  worst  noise,  it  often  runs  between  1/10  and 
1/15*  as  was  the  case  in  the  early  work  on  computed 
tomography  with  digitized  medical  x-ray  films  which  was  the 
origin  of  this  project.  In  a  recent  coffee  conversation  on 
small  parameter  asymptotics  in  turbulence  theory* 
P.  G.  Saffman  remarked  that  the  only  true  small  parameter  in 
turbulence  theory  is  the  signal  to  noise  ratio.  In  such 
situations*  straightforward  linear  smoothing  procedures  tend 
to  broaden  the  noise  peaks  without  reducing  the  magnitude 
sufficiently. 

A  common  redeeming  feature  of  the  noise  is  that  the 
largest  peaks  are  quite  sparse*  lesser  peaks  are  more 
frequent  but  still  sparse*  etc.  This  suggests  a  stepwise 
smoothing  in  which  only  the  worst  peaks  are  removed  during 
the  early  steps  when  there  is  no  very  good  way  to  remove 
them. 

Such  a  procedure*  devised  for  use  with  medical  x-ray 
films*  is  described  in  C13.  In  the  meantime  it  has  been 
modified  in  various  ways  and  used  with  other  data  from 
several  sources.  A  detailed  account  of  experiments  with  the 
procedure  will  be  given  elsewhere.  The  purpose  of  the 
present  report  is  to  describe  the  procedure  and  to  show  the 
results  of  a  series  of  tests  with  data  from  a  computed 
tomography  x-ray  scan  of  a  defective  battery.  The  scan  was 
taken  with  a  developmental  version  of  an  industrial  scanner 
at  the  General  Electric  Research  and  Development  Center  by 
H.  J.  Scudder. 
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2.  The  battery  and  battery  phantom. 


Tha  battary  was  made  of  caramic  material  with  outer  and 
inner  steal  shells.  A  CT  reconstruction  is  shown  below. 


Battery  Cross  Section 

The  scan  data  consisted  of  256  radiographs  (referred  to 
hereafter  as  x-rays).  each  one  digitized  at  452  points. 
Sample  graphs  are  shown  below.  The  ceramic  interior  is 
rather  dense.  and  the  steel  shells  are  very  dense. 
Consequently,  although  the  data  vary  over  the  range  0-32000. 
the  variations  of  interest  lie  in  the  range  0-4000.  In  all 
graphs  after  the  first  the  data  are  cut  at  4000  so  that 
variations  of  interest  can  be  seen. 

To  test  the  noise  elimination  procedure.  a  battery 
phantom  was  created  mathematical ly.  The  phantom  has  the 
gross  features  of  the  battery,  along  with  explicitly  known 
small  variations.  Noise  was  extracted  from  the  battery 
x-rays  and  added  to  the  phantom  to  produce  noisy  phantoms 
for  the  noise  elimination  procedure,  the  objective  being  to 
test  how  well  the  procedure  could  reproduce  the  correct 
phantom  with  its  sharp  peaks  and  small  variations.  Graphs 
of  the  original  phantom.  two  noisy  phantoms.  and  their 
smoothed  counterparts  are  shown  below.  For  purposes  of 
comparison.  the  original  phantom  is  overlaid  on  the 
reconstructions. 


I 


32  000 


Battery  Phantom 


3.  A  description  of  the  procedure. 

The  iterative  procedure  involves  K  steps.  K  interval 
lenrths  L,.  .  .LK.  and  K  percentage  parameters  p,  .  .  .  .  .  p^. 
In  the  present  examples  these  parameters  have  the  following 
values: 

K  35  7.  L  =  <8.  8.  6.  6,  4.  3,3),  p  *  (.  02.  .  02.  .  04.  .  08.  .  12.  .  24.  1  ) 

The  data  are  given  as  a  numerical  sequence  D  of  length  n. 
At  step  k.  corrections  are  made  only  at  the  Pj,  *n  points 
where  the  noise  is  worst,  and  the  corrected  data  at  the  end 
of  step  k  are  denoted  by  Dfc.  Thus.  0o  =  D. 

At  the  beginning  of  step  k  +  1  there  is  computed  a 
sequence  which  gives  a  measure  of  the  noise  in  at  each 
index  i.  From  the  sequence  there  is  derived  the  number 
M||  such  that  the  condition 

(3.  1)  Nfc<i>  > 

is  satisfied  at  p^*n  indices  i.  Corrections  are  then  made 
at  the  points  satisfying  this  condition.  What  remains  is  to 
specify  how  the  noise  and  the  correction  are 
determined  -  points  that  remain  somewhat  up  in  the  air. 

A  basic  assumption  of  the  method  is  that  the  true, 
noiseless  data  is  smooth,  except  perhaps  for  occasional  jump 
discontinuities.  In  line  with  this  assumption.  "average" 
values  are  computed:  a  left  average  AL^ ( i )  of  Dh  over  the 
interval  of  length  l_n  with  right  end  point  at  i.  a 
corresponding  right  average  AR^(i).  and  a  total  average 
Ak(i)  over  the  inteval  of  length  sL^  +  l  and  center  i.  The 
noise  N^i)  is  then  the  minimum  of  I  Dfc(  i )  -ALfc<  i  )|  . 
ID^  C  i  l-AR^  C  i  )|  .  and  10^1  i  )-Aj^<  i  )|  .  When  a  correction  is 
necessary.  according  to  the  criterion  (3.1).  D^,  ( i )  is 
something  similar  to  A^<i). 

The  "averages"  can  be  formed  in  various  ways,  three  of 
which  are  examined  in  this  report.  In  the  first  AL. AR>  and 
A  are  ordinary  averages.  while  D^+,  <i>  is  an  ordinary 
average  over  points  that  do  not  differ  too  much  from  Afc(i). 
In  the  second  AL. AR»  and  A  are  ordinary  averages  again.  but 
Dji+,  (i)  is  roughly  the  median  of  Dfe  over  the  symmetric 
interval.  In  the  third  the  "averages"  are  all  essentially 
medians.  The  results  in  the  second  case  seem  better  than 
those  in  the  first,  while  the  results  in  the  third  case  seem 
better  than  those  in  the  second.  On  the  other  hand,  the 
averages  are  faster  computati onal ly.  though  this  question 
has  not  received  much  consideration.  It  is  possible  that 
the  procedure  of  cases  2  and  3  should  be  followed  in  the 
earlier  steps.  along  with  more  sophisticated  ones  in  the 
later  steps. 


The  battery  data  considered  in  this  report  do  not 
satisfy  tha  smoothness  condition  that  is  basic  to  the 
method.  The  steel  shells  produce  very  large  sharp  peaks. 
In  such  a  case*  the  noisiest  points  (by  the  criterion  above) 
all  lie  around  the  peaks*  and  the  criterion  gives  a  poor 
measure  of  true  noise.  All  early  corrections  are  made  near 
the  peaks  so  that  the  true  peaks  are  eliminated!  while  the 
true  noise  is  left  largely  unmolested.  In  order  to  avoid 
this#  the  peaks  are  cut  off*  the  resulting  data  is  run 
through  the  procedure*  and  the  peaks  are  then  put  back* 
slightly  smoothed.  Around  the  peaks  this  process  leaves 
some  noise*  which  is  almost  invisible  in  Smoothed  Phantom 
(1*100)  but  is  clearly  visible  in  Smoothed  Phantom  2 
(100*200)*  where  the  initial  noise  is  much  larger.  It  must 
be  expected  that  different  kinds  of  data  will  exhibit 
different  problems  that  can  be  dealt  with  only  on  the  basis 
of  special  information  about  the  data  and  the  use  to  which 
it  will  be  put.  For  example*  the  systematic  errors  around 
the  peaks*  which  cause  no  problem  in  '.he  visual  examination 
of  Smoothed  Phantom  (1* 100)*  might  be  very  damaging  when  put 
into  CT  reconstruction  formulas. 


4.  Smoothing  with  averages. 


In  this  saction  tha  smoothing  is  dona  with  avaragas: 
ALfe<i)  is  tha  average  of  D*  ovar  tha  intarval  of  length  Lj, 
with  right  andpoint  at  i<  AR*(i)  is  tha  corrasponding  right 
hand  average/  and  A^d)  is  tha  avaraga  of  D*  ovar  tha 
intarval  of  langth  2Lk*l  with  cantar  at  i.  Dk+,  (i)  is  tha 
avaraga  of  D*  ovar  thosa  points  in  tha  lattar  intarval 
satisfying  N^ij)  <  with  chosan  so  that  ovarall  80%  of 
tha  points  satisfy  tha  condition.  Tha  first  graphs  balow 
show  x-rays  1/ 100/  and  200  along  with  thair  smoothings. 

Sinca  tha  battary  is  not  axplicitly  known/  tha  graphs 
giva  no  maasura  of  tha  accuracy  of  tha  smoothing  procadure. 
HOwavar/  tha  diffarancas  batwaan  tha  original  and  smoothed 
varsions  sarva  to  provide  noise  to  add  to  tha  known  phantom. 
These  diffarancas  are  shown  in  tha  next  sat  of  graphs. 
Sinca  tha  smoothed  varsions  are  smooth/  and  tha  noise  is 
highly  oscillatory/  tha  differences  include  tha  noise/  but 
they  also  include  artifacts  of  tha  procedure.  In 
particular/  tha  large  noise  peaks  at  the  sites  of  the  large 
steal  peaks  of  tha  battary  are  artifacts. 

Tha  noise  added  to  tha  phantom  comas  from  these 
differences  with  intervals  around  the  steel  peaks  omitted. 
Specifically/  the  Noise  <1/100)  comas  partly  from  x-ray  1 
and  partly  from  x-ray  100/  while  tha  Noise2  (100/200)  comes 
partly  from  x-ray  100  and  partly  from  x-ray  200/  but  is  the 
double  of  the  diffarancas.  (In  many  problems  the  noise  has 
larger  magnitude  than  it  does  hare.  ) 

Tha  next  graphs  below  show  the  noise  (i.e. 
differences)  from  x-rays  1/  100/  and  200/  and  the  two  sets  of 
noise  added  to  the  phantom.  It  is  this  same  noise  that  is 
added  to  tha  phantom  in  the  subsequent  sections.  (Sinca  tha 
medians  appear  to  giva  better  results/  there  might  be  fewer 
artifacts  in  noise  produced  with  medians/  but  it  is  also 
useful  to  have  a  situation  where  tha  noise  is  not  created 
through  the  procedure  itself.  In  future  experiments  the 
noise  will  coma  from  x-rays  of  an  axplicitly  known  phantom.  ) 
Tha  final  graphs  show  tha  noisy  and  smoothed  phantoms  with 
tha  correct  phantom  overlaid  on  tha  lattar. 


X-ray  200 


Smoothed  X-ray  200 
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9.  Smoothing  with  averages  and  msdians. 

In  this  saction  ths  smoothing  is  dons  with  both 
avsragss  and  msdians:  AL^*  AR^*  and  A*  ars  ths  laft*  right* 
and  symmatric  avsragss  dsscribsd  in  saction  4*  whils  D^,  (i) 
is  ths  avsrags  of  ths  thras  middls  tsrms  in  ths 
nondsersasing  rsarrangsmsnt  of  D*  ovsr  ths  intsrval  of 
lsngth  2L|i+1  with  csntsr  at  i.  This  moans  that  ths  points 
whsro  corrsetions  ars  to  bs  mads  ars  idontifisd  using 
avsragss*  and  ths  corrsetions  ars  than  mads  using 
(ssssntially )  msdians. 

Ths  graphs  bslow  show  ths  phantom  and  ths  noiss  at  sach 
stop  of  ths  procsss.  Ths  noiss  is  now  ths  diffsrsnes 
bstwssn  ths  currant  phantom  and  ths  trus  phantom.  Ths 
points  whsrs  corrsetions  ars  mads  (vary  fsw  in  ths  sarly 
stops)  can  bs  idontifisd  by  holding  succsssivs  graphs 
togsthsr  against  a  light.  It  will  bs  obssrvsd  that  thsss 
ars  not  ths  points  whsrs  ths  currant  trus  noiss  is  largsst* 
but  that  thsy  ars  situatsd  nsar  ths  stool  psabs.  In  othsr 
words*  ths  magnituds  of  ths  noiss  as  msasursd  by  ths  procsss 
is  dspsndsnt  on  ths  sotting  -  as  would  bs  sxpsetsd.  This 
dspsndoncs  on  ths  sotting  is  rsducsd  whan  ths  magnituds  is 
msasursd  using  msdians*  as  it  is  in  ths  naxt  saction. 
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Initial  Noise 


Phantom  After  Iteration  1 
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Noise  After  Iteration  1 


Phantom  After  Iteration  2 


Noise  After  Iteration  2 


Phantom  After  Iteration  3 


6.  Smoothing  with  medians. 


In  this  section  ovorything  is  dona  with  medians: 
AL||(i)  is  tho  average  of  tha  two  middla  tarms  in  tha 
nondacraasing  raarrangamant  of  0|^  ovar  tha  intarval  of 
langth  with  right  andpoint  at  i*  AR^ti)  is  tha 
corrasponding  right  hand  median*  and  A^Ci)  and  Djjj.,  (i)  are 
tha  symmetric  median  described  in  tha  last  section. 

As  in  tha  last  section*  tha  graphs  show  tha  phantom  and 
tha  noise  at  each  step  of  tha  process.  It  will  be  observed 
that  tha  points  where  corrections  are  made  are  now  in  batter 
accord  with  tha  points  of  large  noise. 


Phantom  After  Iteration  2 


Phantom  After  Iteration  3 


Noise  After  Iteration  3 


Phantom  After  Iteration  6 


7.  Power  spectra. 

This  sections  contains  graphs  of  the  power  spectra  (the 
square  of  the  absolute  value  of  the  Fourier  transform)  of 
the  noise  at  each  step*  the  noise  being  that  of  the 
average*~median  process  of  section  5.  Because  of  the  sharp 
peaks  the  power  spectrum  of  the  phantom  itself  is  quite 
wild*  so  successive  graphs  would  not  be  instructive. 


Phantom  Power  Spectrum 


8.  Noise  histograms  and  norms. 

The  square  of  the  Lr  norm  of  the  noise  at  step  k 
(i.e.  the  standard  deviation  of  the  phantom  at  step  k  from 
the  true  phantom)  is  as  follows. 

average-median:  5476  5319  5325  4970  4742  3660  2818  3080 
median:  5476  4736  4376  3572  3136  2643  2502  2686 


The  number  used  in  (3.  1)  to  identify  points  where 
corrections  are  made  in  step  k  +  1*  is  obtained  from  a 
histogram  of  the  noise  N^.  Hh(j)  is  the  number  of  points 
i  satisfying  N^ ( i )  >  j-1.  The  histograms  below  come  from 
the  median  smoothing  of  section  6.  The  number  M0»  for 
example  is  computed  as  follows.  Since  p,  =  .  02  and  n  =  452* 
the  number  of  points  to  be  corected  at  step  1  is  .  02*452  = 
9.  The  first  histogram  shows  that  there  are  9  points 
satisfying  N0(i)  >  143.  so  M0  -  143. 

The  first  group  of  histograms  shows  the  noise  as  it  is 
defined  by  the  procedure.  The  second  group  shows  the  true 
noise*  i. e.  the  difference  between  the  phantom  at  the  given 
stage  and  the  true  phantom. 
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9.  Remarks. 


During  the  pest  few  years  variants  of  these  iterative 
noise  elimination  procedures  have  been  used  with  several 
kinds  of  data:  various  medical  radiographs*  neutron 
radiographs  of  nuclear  fuel  assemblies*  ultrasound  CT  breast 
scans*  CT  x-ray  reconstructions*  and  CT  x-ray  scans.  Each 
kind  of  data  has  individual  features  and  special 
difficulties*  and  must  be  treated  in  terms  of  these  features 
and  difficulties  and  the  use  to  which  the  data  is  to  be  put. 

Dominant  features  of  the  present  data  are  the  large* 
sharp  steel  spikes  and  the  large  drop  at  the  edges*  features 
that  required  the  cutoffs  described  in  section  3.  To  some 
extent*  probably  to  a  large  extent*  these  features  are 
responsible  for  the  superiority  of  the  median  smoothing  over 
the  others.  Also*  they  are  responsible  for  the  apparent 
increase  in  the  noise  at  the  last  stage  of  the  procedure* 
which  comes  from  errors  at  a  half  dozen  points  of  very  large 
slope.  It  is  not  clear  whether  anything  should  or  can  be 
done  about  this.  The  test  would  be  to  use  the  battery  data 
in  CT  reconstructions.  This  has  not  been  possible  yet*  but 
will  be  done  in  the  near  future.  (In  their  role  in  the  CT 
reconstruction  formulas*  the  systematic  errors  at  the  peaks 
could  be  much  more  dangerous  than  random  noise.  ) 
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